This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 



IMAGES ARE BEST AVAILABLE COPY. 




BLACK OR VERY BLACK AND WHITE DARK PHOTOS 



• GRAY SCALE DOCUMENTS 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 
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REPLICATING BY THE CLOCK 



Alon Goren and Howard Cedar 

The eukaryotb genome is divided into well-defined DNA regions that are programmed 

to replicate at different times during S phase. Active genes are generally associated with early 

replication, whereas inactive genes replicate late. This expressbn pattern might 

be facilitated by the differential restructuring of chromatin at the time of replication in early 

or late S phase. 



G BANDS 

A characteristic chromosome- 
banding pattern that is shown 
by staining with Giemsa. Light 
and dark G bands differ in their 
molecular and regulatory 
features, such as gene density, 
repetitive sequence elements 
and replication timing. 

ISOCHORES 

Long DNA fragments (>300 16) 
defined by their average G+C 
content Isochores are divided 
into five subfamilies according 
to their G+C composition: from 
G+C poor (-40% G+C) to G+C 
rich (-55-60% G+C). 
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Animal-cell DNA is arranged in distinct chromosomal 
bands that undergo replication in a temporally pro- 
grammed way during S phase 1 . Although the reason for 
this organization is not completely understood, there is 
no question that it has functional significance. A great 
deal of research has shown that, both at a regional and 
sequence-specific level, there is a striking and straight- 
forward correlation between replication timing and gene 
expression. Most active genes replicate early (in the first 
half) in S phase, whereas many inactive gene sequences 
replicate late in S phase 2 . Despite this intriguing relation- 
ship, the mechanism that connects replication timing 
and gene expression has not yet been fully elucidated 

One of the most interesting suggestions for explaining 
this phenomenon is the 'window of opportunity' 
model 3 ' 4 . According to this model, the transcriptional 
competence can be influenced at the time of replication, 
when the chromatin structure is disrupted and then 
repackaged on newly synthesized DNA. One possibility 
is that genes undergoing replication in early S phase are 
exposed to factors that are required for the formation of 
active transcription complexes, whereas genes that 
replicate in late S phase experience a different nuclear 
environment, which is more conducive for the genera- 
tion of repressive structures. This model relies on three 
components: first, an independent mechanism for con- 
trolling replication timing; second, the susceptibility of 
chromatin to restructuring during replication; and 
third, variations in nuclear composition as a function of 
progression through S phase. 

Replication time zones 

Animal cells are characterized by a striking division of 
the genome into replication time zones. These are 



best visualized by labelling the cells for various time 
intervals with bromodeoxyuridine (BrdU) and exam- 
ining their incorporation pattern in metaphase chro- 
mosomes 5 (FIG. l). High-resolution analysis of 
prometaphase chromosomes indicates that the aver- 
age length of each time zone is approximately 1 Mb 6 - 7 . 
Early-replicating regions map directly to light g bands, 
which represent the G+C-rich isochores 8 , whereas the 
late-replicating regions map to dark G bands 1 , which 
generally contain A+T-rich DNA 9 * 10 . So, replication 
timing seems to reflect a fundamental aspect of the 
genome sequence structure. 

In contrast to bacteria, which replicate their entire 
genome using a single origin, animal cells copy their 
DNA by initiating bidirectional synthesis at multiple 
origins that are spaced, on average, - 1 00 kb from each 
other 11 . This indicates that each replication time band 
is made up of several replicons, which must fire in a 
coordinated manner — and it is exactly this pattern 
that is observed when DNA synthesis is visualized by 
autoradiography after labelling with ^-thymidine 12 . A 
similar organization was observed using microarray 
technology to map the full complement of replication 
origins in the yeast Saaharomyces cerevisiae. In this sim- 
ple organism, the genome seems to also be made up of 
small clusters that contain several origins which have 
coordinated replication-time properties 13 . The picture 
that emerges from these studies is that the control of 
replication timing is carried out at the regional level, 
and is executed by the coordinated firing of multiple 
origins. 

The timing of replication has also been determined 
for individual genes, by isolating BrdU-labelled DNA 
from cells at different stages of S phase 14-18 and then 
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Figure 1 1 Replication bands. In a typical cell cycle, S phase 
lasts about 8 h, whereas G2 takes about 2 h. Replication 
bands can be observed by labelling with bromodeoxyuridine 
(BrdU) (green) for various lengths of time and then examining 
only those cells that contain metaphase chromosomes. This 
functions as a form of retroactive synchronization. When 
labelled for 3 h, for example, the BrdU that is observed on 
chromosomes must have been incorporated into DNA during 
the last hour of S phase. Labelling for 5 h shows the DNA that 
was replicated during the last 3 h of S phase. The entire DNA is 
labelled when cells are incubated with BrdU for 1 1 h. BrdU 
labelling is regional and appears in a band-Eke pattern, which 
coincides with the replication time zones. 



Unlike these higher organisms 21 , there is no general 
correlation between replication timing and gene expres- 
sion in S. cerevisiae 1 *, but some repressed genes are 
indeed located in subtelomeric regions that replicate late 
in S phase. The HML and HMR mating-type genes are a 
good example of this phenomenon. Although constitu- 
tive! y silent in their telomeric positions, these same 
sequences are actively expressed when they are copied 
into the early-replicating mating-type (MAT) locus on 
the same chromosome 22 . 

As DNA replication seems to be regulated at the 
regional level, it is interesting to address the question of 
how these topographical patterns are altered for develop- 
mentally regulated genes. This type of molecular-band 
analysis has been carried out using fluorescence in situ 
hybridization (FISH) 20 , whereby specific sequences are 
visualized in interphase nuclei from non-synchronized 
cells by using fluorescent probes. Unreplicated loci 
appear as two single dots in diploid cells, whereas repli- 
cated DNA is characterized by double signals (FIG. 2a). A 
high percentage of double dots in S-phase nuclei indi- 
cates early replication, whereas numerous single signals is 
characteristic of late replication. This method is easy to 
carry out, and by using a series of probes that surround a 
specific gene, a good picture of the size of each replication 
time zone can be obtained. 

Analysis of the cystic fibrosis gene (CFTR), for 
example, shows that this sequence is normally located 
in a large late-replicating region on human chromo- 
some 7. In active cells, however, a domain of -700 kb 
becomes early replicating, thereby generating a 'new' 
chromosomal mini-band 20 . A similar process takes 
place at the (3-globin locus that is located on chromo- 
some 1 1 , but, in this case, it is a large region of > 1 Mb 
that becomes early replicating in erythroid precursor 
cells 23 . So, it seems that the developmental control of 
replication timing is also carried out in a regional man- 
ner. What emerges from these studies is that chromo- 
some bands represent a basic, but dynamic, feature of 
the genome that is characterized by specific structure 24 , 
function and replication-timing properties. 



assaying for specific sequences. Early-replicating genes 
are concentrated in fractions from the beginning of 
S phase, whereas late-replicating sequences are only 
found in BrdU-labelled DNA from cells in the later 
stages of S phase. 

Analysis of many genes has shown that there is a 
striking correlation between expression and early 
replication timing, but this relationship is certainly 
not absolute. Microarray analysis in Drosophila 
melanogaster has shown that, unlike most genes, there 
are clearly active gene sequences that are located in 
late-replicating regions, as well as inactive genes that 
replicate early in S phase 19 . Replication timing is also 
subject to developmental regulation. Housekeeping 
genes that are transcribed constitutively, for example, 
undergo replication relatively early in S phase, 
whereas many tissue- specific genes replicate late in 
most cell types, but become early replicating in the 
expressing tissue 240 . 



Asynchronous replication timing 

Although most sequences in the genome are expressed 
equally from both alleles, there is a small class of genes 
that are transcribed preferentially from a single allele in 
each cell. In the light of the strong correlation between 
gene expression and replication timing, it is interesting to 
ask whether these genes replicate in an allele-specific 



Genomic imprinting. Genomic imprinting is a classic 
example of asynchronous replication timing. In this 
case, it is always one parental allele, either the maternal 
or the paternal, that is expressed preferentially. One of 
the most important characteristics of imprinted genes is 
that they are always organized into dusters that occupy 
large chromosome domains. The regulation of imprint- 
ing and the coordination between individual genes in 
each domain are dictated at the regional level by means 
of imprinting control centres 25 (BOX i). 
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Figure 2 1 FISH analysis of monoalleDcally expressed genes, a j Fluorescence in situ 
hybridization (RSH) is carried out by using fluorescentfy labelled probes for specific genes. Cells are 
labelled with bromodeoxyuridine (BrdU) (red) to anatyse nuclei h S phase, exclusively. Two single dots 
tyeQow) indicate that the gene has not yet replicated, whereas two double dots are seen in nuclei in 
which both alleles have already replicated. Hybridization dots and DNA molecules are shown 
schematics^ in this figure. Genes that repfcate asynchronously show a high percentage (>30%) of 
nuclei with one single and one double dot It should be noted that the RSH method can only detect 
rep Beaton after sister chromatid segregation, and therefore might not always represent an accurate 
indication of replication timing, b | The K-bcue repfcates asynchronously in all cefl types, In many B 
cefe, one allele has undergone rearrangement with a single variable (V) region reoombined to one 
junction (J) segment The other allele is not rearranged. The nucleus shows dtamidncphenyfindde 
(DAPI) stahing for DNA (blue) and RSH anal/sis with two separate probes, one from the constant (C) 
region (as indicated by the red line; red labelling) and the other from the region between V and J 
(as indicated by the yelow fine; yeJbw labelling) that is eliminated during rearrangement This nucleus 
stxdws asynchronous replication whereby only one aBele has replicated and the other has not The 
yelow signal identifies the unrearranged germfine allele. In >90% of the cases, the single red dot is 
associated with the yellow RSH signal, which indicates that it is usually the earty-rep Beating allele that 
undergoes rearrangement. 



Although the exact mechanism of imprinting has 
not yet been deciphered, it is clear that each individual 
allele must aquire an epigenbtic mark while still in the 
gametes, and that this differential structure then has to 
be maintained in an autonomous manner throughout 
development There is strong evidence that gamete- 
specific methylation has a crucial function in this 
process 2 *, but it is now clear that this modification alone 
cannot explain all aspects of imprinting, and that other 
epigenetic factors must be involved 27 . 

FISH analysis of interphase nuclei has been used to 
study replication timing at imprinted gene loci. For 
most regions in the genome, both alleles seem to repli- 
cate in a coordinated way, so that nuclei show mostly 
(>85%) either single/single or double/double signals for 
sets of alleles. By contrast, for imprinted loci, a large 
number of nuclei have a single dot on one allele and 
double dots on the other, which indicates that one allele 
has replicated before the other (FIG. 2a). This pattern of 
asynchronous replication is regional in nature and 
encompasses the entire imprinted domain 28 - 29 . 



EPIGENEnC 

Any heritable influence on die 
function of a chromosome or 
gene that is Dot caused by a 
change in DNA sequence. 

X-CHROMOSOME INACTTVATION 
The transcriptional inactivation 
ofoneofthetwoX 
chromosomes in female 
embryos. The choice of the 
maternal or the paternal allele is 
random, but is then maintained 
clonafly in subsequent cell 
generations. The inactive X 
chromosome is characterized by 
DNA methylation, late 
replication timing and 
condensed chromatin structure. 



Although there is no direct evidence that replication 
timing itself influences imprinted gene expression, 
there is good indirect developmental evidence that it is 
indeed important. By using FISH to examine replica- 
tion timing during gametogenesis and early develop- 
ment, it was shown that parent-of-origin-specific 
replication timing is 'set up* around the time of meio- 
sis, is maintained through development and is then 
erased early during the next round of gametogenesis 30 . 
This epigenetic feature therefore fulfils all of the criteria 
of a primary imprinting mark (BOX l ). Furthermore, as 
in almost all cases it is the paternal allele that replicates 
early, this also represents a mechanism by which the 
parental allele in each cell can be identified. 
Asynchronous replication timing is controlled by the 
same regulatory centres that define the imprinted chro- 
matin and expression patterns of the two alleles 31,32 , 
which indicates that it is indeed an integral part of the 
domain-wide imprinting process. 

X-chromosome inactivation. Another form of 
monoallelic expression is exemplified by x-chrdmosome 
iNAcnvAnoN in female mammals. In this process, one X 
chromosome in each cell is chosen randomly to 
undergo chromosome-wide inactivation in the late 
blastocyst, and this inactivation is then maintained in a 
clonal manner throughout development. As a result, 
every somatic cell has only one active X chromosome, 
either the maternal or the paternal. This process is 
directed in as by the x-inaotvation centre (Xic), which 
is responsible for a range of molecular events, includ- 
ing the synthesis of an X -inactivation specific tran- 
script (Xist) and chromosome- wide epigenetic 
changes that involve chromatin structure, DNA 
methylation and replication timing 33 . Although the 
inter-relationship between these epigenetic marks is 
not well understood, it is worth noting that the shift to 
late replication timing is one of the first developmental 
changes that is associated with the inactivation 
process, which occurs as early as several days before de 
novo methylation in the mouse 34 . 

Allelic exclusion. In addition to sequences on the X 
chromosome, several other genes have also been found 
to have a random monoallelic expression pattern. In 
these cases, which include immune-system- and olfac- 
tory-receptor genes, the regulation process is also 
regional and characterized by asynchronous replica - 
tion 35,3 *, and, in this sense, they behave in a similar way 
to the X chromosome. Both alleles replicate in an appar- 
ently synchronous manner at early stages of embryo- 
genesis, but at the time of implantation, one allele 
becomes early replicating and the other late replicating 
in each individual cell. This pattern is maintained in a 
clonal manner throughout development, with the 
paternal allele replicating early in some cells and late in 
others 30 - 37 . 

The immune-system-receptor genes are a good 
example of how this epigenetic mark can influence 
function. During lymphoid development, designated 
B- and T-cell-receptor loci undergo rearrangement as a 
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Box 1 I Genomic imprinting 



Genomic imprinting was originally discovered from nuclear 
transplantation experiments in mice, which showed that 
development can only proceed normally if the embryo is 
constructed from both maternal and paternal genome 
components 77,78 . This effect evidently comes about because 
some genes are only expressed from a single allele, either 
maternal or paternal. Further studies showed that these genes 
are organized into clusters that are subject to coordinated 
control 25 . 

Genomic imprinting is controlled by parent- of- origin- 
specific epigenetic marks. Imprinted domains contain an 
imprinting centre (white), which is marked during late 
gametogenesis to identify it as either paternal (purple circle) or ^ 
maternal (red square). Markings might include DNA 
methyiation, replication timing and chromatin structure. 
After fertilization, these epigenetic states are stable and are 
maintained through cell division, but they are erased in early 
gametic cells that are derived from the embryo. After 
implantation, the imprinting centre can induce additional 
epigenetic signals on the domain, and this brings about an 
imprinted expression pattern, with some genes being 
specifically expressed (yellow) on the maternal allele (pink) 
and others on the paternal allele (green). 

Imprinting is also of medical significance, and several genetic 
diseases are associated with defects in imprinted gene regions. 
Chromosome 15, for example, contains a large cluster of 
paternally expressed imprinted genes 79 , and deletion of the 
paternal allele causes Prader-Willi syndrome, a developmental 
disease that is characterized by defects in growth control and 
brain function. Deletion of the maternal allele, on the other 
hand, brings about Angelman syndrome, which is a completely 
different disease that is associated with severe mental 
retardation. In rare cases, these diseases can actually come 
about through the disruption of the imprinting centre itself, 
which leads to misregulation of the entire imprinted domain 79 . 



Early 

gametogenesis 




Somatic cell 



X-INACTIVAnON CENTRE 
(Xic). A as-acting region on the 
X chromosome that produces 
the X-mactr/atkm-specific 
transcript (Xist) that is necessary 
for initiating X-chromosome 
inactivation in female cells. 

ALLELIC EXCLUSION 
The process by which a cell 
(for example, from the immune 
or olfactory system) uses either 
the gene from its maternal 
chromosome or die one from 
the paternal chromosome, 
but not both. 



mechanism for generating the B- and T-ceil-receptor 
repertoires. They do this by choosing individual vari- 
able (V) and diversity (D) genes from a multigenic 
cluster, thereby assembling a specific immune-receptor 
molecule 58 (FIG. 2b). Demethylation and chromatin 
opening are required to make these regions accessible to 
the rearrangement machinery, and occur initially on 
only a single allele in each cell". This might be one of 
the key mechanisms for achieving allelic exclusion 40 . 
Strikingly, in almost all instances, this first rearrange- 
ment event takes place on the early-replicating allele 
(FIG. 2b), which indicates that this decision is actually 
programmed into the immune-receptor loci early in 
development 

Olfaction is another system that involves the selec- 
tion, in each cell, of a single candidate from a multigenic 
array. More than 1,000 individual olfactory-receptor 
genes are organized into numerous clusters that are scat- 
tered throughout the genome In each olfactory neuron, 
however, only a single gene is actually expressed on its 
surface 41 . This is carried out through a complex process 
that stochastically selects a single gene from one devel- 
opmentally predetermined cluster, and only one 



parental allele, either the maternal or the paternal, is 
expressed in each cell 35 . Although the mechanism for 
this type of allelic exclusion is not known, it should be 
noted that all olfactory-receptor loci have been shown to 
undergo random asynchronous replication, and this 
might reflect inherent differences between the two alleles 
in each cell. The choice between X-chromosome-Iinked 
green or red pigment genes in cone cells of the retina 
might also operate by a similar mechanism 42 . So, it 
seems that asynchronous replication timing reflects a 
basic difference in chromatin structure between the two 
alleles. This epigenetic marker is set up early in develop- 
ment, maintained in a clonal manner and can then be 
used as a mechanism of allelic exclusion during terminal 
differentiation. 

As many regions of the genome undergo random 
asynchronous replication, it is interesting to ask 
whether this process is coordinated, and if so, how? To 
this end, several gene regions were analysed for replica- 
tion timing in monoclonal cell lines. Four individual 
markers that are scattered over the full length of chro- 
mosome 6 were all found to replicate in a coordinated 
manner; some clones had all four markers replicating 



28 j JANUARY 2003 | VOLUME 4 



www.nature.com/reviews/moiceKbio 



© 2003 Nature Publishing Group 



REVIEWS 




Figure 3 1 Control of replication timing. A DNA region with two origins (white) is shown. One 
has as-acting sequences that direct earty replication (yellow) and the other has a as-acting 
element that directs late replication (red). At the time of mitosis, the nuclear membrane is 
disrupted, and it then takes some time before the nucleus becomes reorganized in early G1 . 
At the timing decision point (TOP), frsns-acting factors (pink and green ellipses) interact with the 
ds-acting sequences to induce either an early (yellow 'dock 1 ) or a late (red 'clock') epigenetic 
state on the origins. This epigenetic mark could be in the form of nuclear localization or histone 
modification. Once the origin is marked, these as-acting sequences and trans-acting factors 
are no longer required. During early S phase, the early origin is identified by earty-S-phase-specific 
frans-acting factors (blue triangles) that cause it to replicate. In late S phase, a different set 
of trans-acting factors (orange triangles) recognizes the late origins and causes them to fire. 
The epigenetic states of earty and late origins are then erased before the next G1 stage 
of the cell cycle. 



early on the maternal allele, whereas others had all four 
markers replicating early on the paternal allele. 
Asynchronous replicating regions on other chromo- 
somes also showed internal coordination, but different 
chromosomes behaved independently 80 . These findings 
indicate that each autosome might have one or more 
central control elements that function to coordinate 
several asynchronously replicating domains in as, in a 
manner that is similar to the Xic. 

C/s-acting regulatory sequences 

Although the replication time of any genomic 
sequence seems to be a function of both the firing 
time and its distance from the nearest origin — that 
is, late replication can occur from early-firing 
origins 43 — these elements themselves do not usually 
have any function in actually setting the replication 
clock. In yeast, almost all of the authentic origins of 
replication have been identified 13 , and some have 
been subjected to genetic analysis. By moving a late- 
replicating origin away from its subtelomeric posi- 
tion, it is able to initiate replication in early S phase 44 . 
Conversely, the placement of an early-replicating 
origin near the telomere automatically dictates that 
it will fire late in S phase, in both yeast 45 and human 
cells 46 . 



LOCUS CONTROL REGION 
(LCR).A large regulatory 
sequence that harbours several 
elements that control gene 
expression and chromatin 
structure during development. 

CHECKPOINT 
A point at which the 
cell-division cycle can be halted 
until conditions are suitable 
for the cell to proceed to the 
next stage. 



These experiments indicate clearly that replication 
timing is directed by ris-acting elements that can exert 
control over origin sequences that are located in their 
domain of influence, in a way that is possibly similar to 
how enhancers activate gene promoters. As many of the 
late-replicating regions in the yeast genome are actually 
subtelomeric, it is probable that sequences in the telom- 
ere itself are important Indeed, studies with episomal 
plasmids have shown that even short C 13 A tracts can 
drive late replication from a test origin 45 . Furthermore, 
genetic experiments indicate that this might be medi- 
ated by the telomere-binding proteins Ku 47 and Sir 3 
(REE 48). In contrast to this relatively simple system, late 
replication of a multi-origin, 1 30-kb non-telomeric 
region on S. cerevisiae chromosome XIV is directed by a 
combination of at least three sequence elements that 
seem to work in concert 49 . 

The replication origins in animal cells are also sub- 
ject to developmental control. The human and mouse 
P-globin loci, for example, undergo bidirectional repli- 
cation from origin sequences that are located upstream 
of the p-globin gene 50 - 54 . The same origin region is 
used in both non-erythroid cells, which replicate this 
region late in S phase, and in erythroid cells, in which 
the locus undergoes replication in early S phase. 
Experiments in human cells from patients with his- 
panic thallassaemia indicated initially that this process 
is regulated by cis-acting sequences that are located 
within 40 kb surrounding the locus control region 
(LCR) 55 . It has now been shown definitively that 
replication timing in this region is controlled by 
several individual elements both within 23 and outside 56 
the LCR. 

It seems that neither early nor late replication is the 
default state at this locus. Rather, the LCR operates more 
like a toggle switch that can direct, in a dominant man- 
ner, either early replication in erythroid cells or late 
replication in other tissues 23 . This is undoubtedly medi- 
ated by tissue-specific rmns-acting factors, because the 
introduction of a normally late-replicating p-globin 
locus into an erythroid environment is sufficient to 
bring about early replication of the entire region 55 . The 
fact that the LCR is a multifunctional effector that is 
responsible for the developmental regulation of gene 
expression, chromatin structure and replication timing 
indicates clearly that all of these parameters are 
intimately related and centrally controlled 

Control of replication timing 

The replication-timing process seems to involve three 
key components: the epigenetic marking of origins as 
either early or late; the use of cell-cycle- regulated 
protein factors to activate origins at the proper times 
in S phase; and a system of checkpoint genes that ensure 
the replication timing programme is carried out accord- 
ing to schedule. 

Epigenetic marking of replication origins. Although repli- 
cation itself takes place exclusively during S phase, the 
programming of this process begins in the Gl stage of the 
cell cycle when each origin is made competent for firing 
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HCTEROCHROMATTN 
A cytolopcally defined genomic 
component that contains 
repetitive DNA (highly repetitive 
satellite DNA, transposable 
elements and ribosomal DNA 
gene clusters) and some protein- 
encoding genes. 

HISTONE ACETYLTRANSFERASE 
(HAT). An enzyme mat modifies 
histone tails covalenny by adding 
an acetyl group to lysine 
residues, thereby changing their 
structure. 





Figure 4 1 Chromatin formation in early and late S phase 
— a model. During replication, new nucteoeomes (pink) are 
assembled on one molecule of newly synthesized DNA, 
whereas the other retains old nucleosomes (brown). The new 
nucteoeomes might already be pre-acetytated. In early 
S phase, the replication machinery (blue) assembles DNA 
using pro-made nucleosomes (already acetytated). In rate S 
phase, the histone deacetylase HDAC2 is present at 
replication foci, and this causes local deacetylation 72 For this 
reason, late-replicating DNA is assembled with 
underacetylated Nstones. It should be noted that this model is 
based on experiments that were carried out in tissue culture, 
and further studies will be needed to confirm this mechanism 
and establish its actual contribution to gene repression in wo. 



through the formation of a pre-repiication complex. The 
feet that this can only be accomplished in G 1 (REE 57) is an 
important aspect of replication control, because it ensures 
that each origin can only fire once per cycle. In a similar 
way, the instructions for establishing the time when each 
origin will fire in S phase are also provided during Gl. 
This was shown in a set of experiments in yeast, in which 
recombinase recognition sequences were introduced to 
flank a late-replicating origin that was located adjacent to 
the telomere By inducing recombinase activity at specific 



times, the origin fragment could be released from the 
telomeric regulatory sequences at different pre-deter- 
mined points in the cell cycle. The occurrence of recombi- 
nation after the cell had passed through the first few hours 
of Gl produced an episomal DNA fragment that was 
already committed to replicate late in S phase. By contrast, 
detachment before Gl caused the loss of control and the 
resulting plasmid reverted to early replication 44 . These 
results indicate that ds-acting sequences are needed in 
early Gl to set up the replication timing profile, but, 
once this has been done, these elements are no longer 
required (HG. 3). 

Animal cells might use a similar strategy for controlling 
replication timing. This was shown by assaying the replica- 
tion properties of pre-Iabefled animal-cell nuclei that were 
placed in Xenopus egg extracts. Nuclei taken from cells in 
late Gl phase undergo early and late replication in accor- 
dance with their pre-programmed schedule. By contrast, 
nuclei that have not yet passed through a crucial timing 
decision point (TOP) at the beginning of Gl are unable to 
direct organized replication timing 57 . 

Differences in replication timing seem to be associ- 
ated with major variations in nuclear organization. 
Early replication always takes place at distinctive loci 
that are dispersed throughout the interphase nucleus. 
By contrast, replication during middle S phase is asso- 
ciated with the nuclear periphery, whereas late-repli- 
cation foci colocalize with heterochromxttn 58 . It is 
striking that these structural features seem to be set up 
together with the replication time profile in early Gl 
(REFS 57, 59). Although this is suggestive, there is as yet 
no direct evidence that the establishment of replica- 
tion timing is mediated by the nuclear restructuring 
that occurs after each passage of the cell through 
mitosis. Indeed, it has been shown both in yeast and in 
animal cells that, once set up, the replication timing 
instructions remain fixed, even if there is a change in 
nuclear positioning 

Local chromatin structure might also be involved in 
the determination of replication timing. In yeast, for 
example, early replicating origins seem to be packaged 
together with acetyiated histories, whereas late-replicating 
origins are part of a deacetyiated structure Furthermore, 
when a histone acetyltransferase (HAT) was recruited 
artificially to a late-replicating origin, this was sufficient to 
shift the replication event to an earlier time in S phase". 
In animal cells, treatment with a histone deacetylase 
inhibitor also affects the replication timing at specific 
loci 63 . These studies indicate clearly that histone acetyla- 
tion is one of the epigenetic marks that direct replication 
timing, although others, including DNA methylation, 
might also be important in this process 6 ** 65 . 

Activation and regulation of origin firing. In the next 
step of replication-timing control, the signals that 
mark replication origins as being early or late must be 
read and interpreted during S phase. Judging by other 
cell-cycle-controlled processes, this might require the 
generation of a gradient over time, and of protein fac- 
tors that can recognize the predetermined epigenetic 
pattern that is set up during Gl (fig. 3). It has been 
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CHROMATIN 
IMMUNOPREQPnXTION 
(ChIP). A technique that isolates 
sequences from soluble DNA 
chromatin extracts (complexes 
of DNA and protein) by using 
antibodies that recognize 
specific chromosomal proteins. 

CPG ISLANDS 

Sequences (0.5-2 kb) mat are 
rich in the CpG dinudeotide* 
which are mostly located 
upstream to housekeeping and 
some tissue-specific genes. 
They are constitutively 
non-methylated in all animal 
cell types. 



shown that cyclins themselves are implicated in the 
control of replication timing. So, in yeast, for example, 
the B cyclin CIb5 is required specifically for the replica- 
tion of late origins, and, in its absence, the entire 
genome is copied using early-firing origins 
exclusively 66 . Rad53 and Mecl seem to be used to 
coordinate the normal process of replication -timing 
control by delaying late-origin firing until the early 
replicons have completed their process of elonga- 
tion 67,68 . As such, this is an important component of 
the intra-S-phase checkpoint mechanism 69 . 

Effect of replication timing on gene expression 

There is now extensive evidence that indicates that the 
genome- wide pattern of replication timing is regulated 
in an independent and programmed manner, and is not 
merely a secondary consequence of gene expression 23,56 . 
This raises the key question of whether replication tim- 
ing itself can affect transcription. As proposed by the 
'window of opportunity* model, replication causes the 
disruption of chromatin structure and, as a result, pro- 
vides a unique opportunity for nuclear factors to inter- 
act directly with DNA sequences, thereby influencing 
the transcriptional competency 70 . As the protein envi- 
ronment in the nucleus most certainly changes with 
progression through S phase, replication timing could 
have a crucial function in establishing the expression 
profile of the genome. 

A basic prediction of this model would be that 
reporter genes should undergo transcription more effi- 
ciently when introduced into an early-S-phase, as 
opposed to a late-S-phase, nucleus. This idea has recently 
been tested by single-cell microinjection experiments 71 . 
Reporter genes that were injected into early-S-phase 
nuclei were almost tenfold more transcriptionally active 
than if the same gene was injected into late-S-phase 
nuclei And, once established, these transcriptional states 
remain stable when the cell continues cycling. As similar 
results were obtained with various promoter sequences, 
this effect probably cannot be attributed to the involve- 
ment of transcription factors. Instead, it now seems that 
this S-phase specificity is probably mediated by chro- 
matin structure. Indeed, chromatin iMMUNOPREcipnxnoN 
(ChIP) analysis shows that early-injected DNA is pack- 
aged into chromatin that contains acetylated histories, 
whereas late-injected templates are underacetylated 71 . 

Evidence exists that shows a correlation between 
early replication and open chromatin structure, as 
determined by DNase I sensitivity 24 and histone acetyia- 
tion 72 . The injection experiments described above now 
indicate that replication timing itself might have a role 



in establishing this genome- wide pattern. The mecha- 
nism for this process could involve the histone deacetylase 
HDAC2, which has been shown to localize exclusively to 
late-S-phase replication foci 73 . So, although all newly 
replicated DNA might be packaged initially with acety- 
lated histone H4 (refs 74,75), nucleo somes that are 
assembled in late S phase could be deacetyiated specific 
cally. Given that the replication-timing programme is 
reset automatically at the beginning of each cell cycle, 
this might represent a simple system for maintaining 
chromatin states in dividing cells (FIG. 4). 

Role of replication-timing control 

Unlike the genome of simple organisms, in which 
almost all of the genes are expressed ubiquitously, the 
animal genome contains numerous gene sequences that 
are repressed in most cell types. This process is con- 
trolled in a complex way by a combination of sequence- 
specific and global repression mechanisms. In the case 
of DNA methylation, for example, all non-a>G island 
sequences are methylated automatically early in devel- 
opment regardless of their sequence. This pattern brings 
about a closed chromatin conformation, which can be 
passed on from generation to generation by means of a 
semi-conservative maintenance mechanism 76 . So, it 
seems that DNA replication timing might work in a 
similar way. Regions that contain constitutively active 
genes are programmed to be early replicating, whereas 
the rest of the genome replicates late, thereby creating a 
global pattern of repression that can be regenerated 
during each cell cycle Furthermore, replication timing 
can also be regulated, and, in this way, allows tissue- 
specific genes to undergo programmed activation 
during development 

Perspectives 

There are three potential ways to maintain gene repres- 
sion in dividing cells. First, the most direct mechanism is 
genetic in nature and involves DNA sequence elements 
that function to recruit fra/is-acting factors to specific 
regions of the genome after each round of replication. 
Second, DNA methylation, which inhibits gene expres- 
sion by altering the chromatin structure, is a semi-genetic 
mechanism; although it is not based directly on sequence 
information, it does involve covalent changes in DNA 
structure. Third, repUcation-timing control seems to con- 
stitute another mechanism for maintaining functional 
states by directing the restructuring of chromatin after 
DNA replication. This pathway is truly epigenetic, as, 
once established, it is completely independent of local 
DNA sequences. 
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Coordination of the random 
asynchronous replication of 
autosomal loci 



Published online 10 February 2003; doi: 10. 1038/ngl 102 

Random monoallelic expression and asynchronous replication define an unusual 
class of autosomal mammalian genes. We show that every cell has randomly chosen 
either the maternal or paternal copy of each given autosome pair, such that alleles 
of these genes scattered across the chosen chromosome replicate earlier than the 
alleles on the homologous chromosome. Thus, chromosome-pair non-equivalence, 
rather than being limited to X-chromosome inactivation, is a fundamental property 
of mouse chromosomes. 



Monoallelically expressed genes fell into 
three distinct classes, X inactivation in 
female cells is a random process resulting 
in half of the cells choosing the maternal X 
chromosome and half choosing the pater- 
nal X chromosome 1 . By contrast, autoso- 
mal imprinted genes such as Igf2 and H19 
are monoallelically expressed according to 
the parent of origin 2 . The third class, ran- 
domly monoallelically transcribed autoso- 



mal genes, includes the large family of 
odorant-receptor genes 3 as well as genes 
encoding the immunoglobulins 4 , T-cell 
receptors 5 , interleukins 6 * 7 , natural killer- 
cell receptors 8 and pheromone receptors 9 . 

All monoallelically expressed genes share 
the property of asynchronous replica- 
tion 3,7,10 , defined as one allele replicating 
earlier in S phase than the other allele. For 
most other genes, both alleles replicate syn- 



chronously at a defined portion of S phase. 
Asynchronous replication is an epigenetic 
mark that appears before transcription and 
may underlie the differential behavior of 
two alleles of identical sequence 10 . For those 
genes whose transcription is randomly 
monoallelic the asynchronous replication 
is also random. The asynchronous replica- 
tion seems to be established early in devel- 
opment before tissue-specific transcription 
is established 10,11 and is therefore found 
even in tissues in which the genes are not 
expressed 3 . For example, the asynchronous 
replication of odorant-receptor genes has 
been observed in all cell types analyzed, 
including fibroblasts and lymphocytes. The 
presence of asynchronous replication in a 
variety of cell types allowed us to compare 
the replication timing of diverse monoallel- 
ically expressed genes that are expressed in 
different cells. Given that these genes are 
widely dispersed across autosomes 12,13 , we 
sought to establish the extent to which their 
replication asynchrony is coordinated. We 
focused on four autosome pairs, each con- 
taining distinct loci of randomly monoallel- 
ically expressed genes (Fig. la). 
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Fig. 1 Coordination of odorant-receptor asynchro- 
nous replication for individual mouse chromo- U 
some pairs, a. Diagram showing the relative 
positions of odorant-receptor genes (red) and 
other monoallelically expressed genes analyzed in 
this study (blue) along with the location of control 
genes (black). Centromeric ends are at the top. 
b-d, Two-color FISH analysis was done on a popu- 
lation of mouse embryonic fibroblasts. Blue repre- 
sents DAPI staining of chromatin, b, Analysis of 
Chromosome 11. The Cy3-labeled probe (red) 
identifies the Olfrl odorant-receptor gene and 
the FfTC-labeled probe (green) identifies OlfrW. 

The double-dot signals for the two probes in these Tg(Olfr19)Y1 1 

images are on the same chromosome, indicating 
coordination of these two distant loci (30 of 30 

cells counted). A control probe, Myh4, located 2 6 7 11 

between the O/frfand Olfr10 loci, is synchronously 
replicating (9% single dot-double dot pattern). 

c. Similar analysis of Chromosome 7 for two odor- 
ant-receptor genes from distinct clusters, OlfrS 
(red) and Olfr41 (green), showing coordination 
(32 of 37 cells counted). Control probes between 
the 0/fr5and Olfr41 loci Included the gene encod- 
ing tyrosinase, which is synchronously replicated 
in wild-type cells (13% single dot-double dot pat- 
tern), and the asynchronous but imprinted gene 
Snrpn. As expected, Snrpn did not show coordina- 
tion with an odorant receptor (data not shown). 

d. Similar analysis showing that two odorant- 
receptor genes from distinct clusters of Chromo- 
some 2 were coordinated (33 of 36 cells counted): 
Olfr48 (red), Olfr3 (green). A control probe, DM, 
is synchronously replicating (10% single dot-dou- 
ble dot pattern). e,f, FISH analyses of line A. 5 (see 
Supplementary Table 1 online) detected with a 
p-geo probe (green) that identifies the maternal 
Chromosome 11. Examples of cells probed with 
Olfrl (red, e) and Olfrl 0 (red, f) are shown. Both 
of these odorant-receptor genes are maternal 
early-replicating In line A.5 (for data on all similar 

cell lines, see Supplementary Table 2 online). Olfrl Olfrl 0 

g,h, Lack of coordination between Chromosomes 
7 and 1 1. In each case, the maternal chromosome 

is marked by the green probe. The maternal Chromosome 11 has a p-geo insertion and the paternal Chromosome 7 has a deletion at the tyrosinase locus. 
Line A.I (see Supplementary Table 3 online) shows lack of coordination between paternally early-replicating Olfrl on Chromosome 11 (g) and maternally 
early-replicating Olfr41 on Chromosome 7 (n). 
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Fig. 2 A variety of monoa Helically expressed 
genes show coordination, a-d. Analyses of a 
clonal cell line that has a marked maternal copy 
of Chromosome 6 ({J-geo transgene, green). 
Probes for the fgk cluster (a), a large V1R 
pheromone receptor cluster (VNO-61; b), the 
Tcrb (c) and an odorant-receptor cluster con- 
taining Offr47 (cO are shown (red). Line F.I has 
all four genes maternally early replicating. Line 
C.1 (see Supplementary' Fig. 2 online) has all 
four paternally early replicating. Two control 
probes, Hoxa and CdA, each showed synchro- 
nous replication (12% and 18% single dot-dou- 
ble dot pattern, respectively). «, Analyses of 
uncloned fibroblasts (similar to the analyses 
shown in Fig. "Ib-d) also showed coordination 
of an odorant-receptor gene cluster on Chro- 
mosome 6 containing 0/fr47 (red) and the VNO 
V1R cluster (green; 30 of 32 cells), f, Same 
analysis as In e, but comparing coordination of 
Olfr47 with Tcrb (33 of 34 cells), g, A similar 
population analysis showed coordination of 114 (red) and OlfrlO (green) on Chromosome 1 1 (25 of 26 cells), h, Example of coordination of endogenous 0lfr48 
(green) and transgenic Tg(Olfr19)Y1 1 (red) on Chromosome 2 (30 of 31 cells; P < 0.0000001). 



Asynchronous replication can be 
assayed by fluorescence in situ hybridiza- 
tion (FISH) analysis of interphase nuclei 14 . 
Replicated loci are visualized as a double- 
dot hybridization signal, whereas unrepli- 
cated loci are visible as a single dot. 
Asynchronously replicating genes present 
a single dot-double dot pattern in 30-40% 
of S-phase cells, whereas synchronously 
replicating genes present this pattern in 
roughly 10-15% of S-phase cells 14 . 
Although the FISH assay is only an indi- 
rect assessment of replication timing, 
asynchronous replication observed with 
this assay has been corroborated by direct 
measurements of replication timing (refs. 
10,1 1; see Supplementary Note 1 and Sup- 
plementary Fig. 1 online). To assess coor- 
dination of distant loci on a given 
chromosome, we used two-color FISH 
analysis to examine two genes simultane- 
ously and scored cells that presented a sin- 
gle dot-double dot signal for both genes. If 
the two genes are coordinated, and are 
replicated during a overlapping portion of 
S phase, the double dots for both genes 
should be on the same chromosome 
(maternal or paternal) and therefore close 
to each other in the nucleus. If the two 
genes are not coordinated, the double dots 
for both genes should be on the same chro- 
mosome only 50% of the time. 

Using this approach we assessed the 
potential for coordination of asynchro- 
nous replication in wild-type primary 
mouse embryonic fibroblasts, analyzing 
two distinct odorant-receptor loci on 
Chromosome 11 that are 14 cM apart. 
Notably, we observed coordination in all 
of the 30 cells in which both probes pre- 
sented the single dot-double dot pattern 
(Fig. lb). Similarly, we observed coordi- 
nation for two distant loci on Chromo- 
some 7 (in 32 of 37 cells) and for two 
distant loci on Chromosome 2 (in 33 of 



36 cells; Fig. \c,d). As expected, in each 
case, genes between the distinct odorant- 
receptor loci replicated synchronously. 
These data indicate that odorant-receptor 
genes have long-range coordination of 
their replication asynchrony for the three 
autosomes examined. 

We examined whether coordinated 
asynchronous replication of odorant- 
receptor genes, once established, is herita- 
ble in the progeny of a given cell. We 
derived clonal cell lines from embryonic 
and adult mice with distinguishable 
maternal and paternal chromosomes for 
Chromosomes 7 and 11. We analyzed 
seven cell lines for Chromosome 7 and 
eight cell lines for Chromosome 11. In 
some cell lines we consistently observed 
early replication of the maternal allele 
(Fig. le), and in the other cell lines we 
consistently observed early replication of 
the paternal allele (see Supplementary 
Tables 1 and 2 online). These analyses 
indicate that for each odorant-receptor 
gene, the random choice of one of the two 
alleles to replicate early, once established, 
is heritable. Analyses of these clonal cell 
lines also confirmed coordination of asyn- 
chronous replication along a given chro- 
mosome. For all the cell lines analyzed, 
both loci replicated the same parental 
allele early (Fig. le/and see Supplemen- 
tary Tables 1 and 2 online). 

To test for genome-wide coordination, 
we analyzed clonal cell lines derived from 
mice carrying marks allowing us to distin- 
guish the parental origins of two chromo- 
some pairs at a time. We used FISH 
analysis to compare the replication timing 
of odorant-receptor genes on Chromo- 
somes 6 and 11 with odorant-receptor 
genes on Chromosome 7 in pakwise com- 
parisons. Odorant-receptor genes on dif- 
ferent chromosomes were not coordinated 
in their replicative asynchrony, we 



observed all possible outcomes (Fig. \g)t 
and see Supplementary Table 3 online). 
Thus, rather than genome-wide coordina- 
tion, asynchronous replication of odorant- 
receptor genes seems to be coordinated 
only at the level of each chromosome pair. 

To explore whether other randomly 
monoallelically transcribed genes are also 
coordinated in their replicative asyn- 
chrony, we examined clonal cell lines in 
which we can distinguish the maternal 
and paternal copies of Chromosome 6. We 
observed asynchronous replication of the 
Igk constant region, a V1R pheromone- 
receptor gene and Tcrb (Fig. 2a-c). 
Notably, all three of these loci were coor- 
dinated with each other as well as with an 
odorant-receptor gene cluster on Chro- 
mosome 6 (Fig. 2a-d). In some clonal cell 
lines the maternal alleles of all four genes 
replicated early (Fig. 2a-d), and in others 
the paternal alleles of all four genes repli- 
cated early (see Supplementary Fig. 2 
online). Analyses of uncloned populations 
of cells (similar to the analyses presented 
in Fig. Xb-d) also showed coordination of 
genes on Chromosome 6 (Fig. 2ej). Simi- 
lar population analyses showed that on 
Chromosome 1 1 , the gene encoding inter- 
leukin-4 (114) was coordinated with the 
odorant-receptor genes (Fig. 2g). These 
data, taken together, indicate that all ran- 
domly asynchronously replicated genes 
examined are coordinated along each 
chromosome. 

One question arising from these observa- 
tions is whether the coordination mecha- 
nisms used by different chromosomes can 
communicate with each other if sequences 
from different chromosomes are artificially 
placed in cis. We analyzed a small odorant- 
receptor translocation that we created arti- 
ficially: a 300-kb odorant receptor- 
containing YAC transgene derived from 
Chromosome 16 that is integrated on 
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Chromosome 2 (Tg(0lfrl9)Yll). Weprevi- 
ously showed that this transgenic odorant- 
receptor locus undergoes asynchronous 
replication 15 . Two-color FISH analysis 
showed that the transgene was coordinated 
in its asynchronous replication with the 
endogenous odorant-receptor loci on 
Chromosome 2 (Fig. 2h), suggesting simi- 
larities in the mechanisms governing allele- 
specific replication timing on different 
chromosomes. 

Here, we present data indicating that 
randomly monoallelically expressed genes 
coordinate their asynchronous replication 
within each chromosome pair. Scattered 
genes along a given chromosome are coor- 
dinated in their asynchronous replication 
timing in ris, leaving unaffected the bulk of 
the genes (which are synchronously repli- 
cated or, in rare instances, asynchronous 
but imprinted). Asynchronous replication 
is established early in development 10,11 and 
maintained in the progeny of individual 
cells in a clonal manner (ref. 10; Fig. \e-h 
and see Supplementary Tables 1 and 2 
online). Randomly monoallelically 
expressed genes are expressed in different 
cells of a given cell type or in different cell 
types. Therefore, coordination of replica- 
tion timing does not imply coordination of 
transcription of distinct gene families and 
is instead a consequence of the early devel- 
opmental mechanisms that establish asyn- 
chronous replication. Each gene family 
probably makes use of asynchronous repli- 
cation (and the differences in chromatin 
structure that it reflects) in the complex 
gene regulation that characterizes these 
gene families. In the case of the 
immunoglobulin genes, we have recently 



shown that early replication correlates with 
the allele that will first undergo rearrange- 
ment and therefore provides a basis for the 
establishment of allelic exclusion 10 . 

X in activation has been known for 
decades. Our data indicate that chromo- 
some-pair non-equivalence is also found on 
autosomes and thus is a general, fundamen- 
tal property of chromosomes that affects a 
large number of loci dispersed throughout 
the genome The autosomal non-equiva- 
lence we observe is similar to that observed 
with X inactivation, except that a larger 
fraction of the genes on the X chromosome 
are affected. Other similarities in the under- 
lying mechanisms of X inactivation and 
autosomal non-equivalence may emerge 
with further investigation. 

Note: Supplementary information is avail- 
able on the Nature Genetics website, 
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Commentary 

Cancer epigenetics takes center stage 

Andrew P. Fein berg 
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Next year will mark 20 years since I 
developed a Southern blot showing 
altered DNA methylation in cancer. This 
discovery (1) was met with some skepti- 
cism, primarily because it was thought that 
aberrant methylation in cancer was an 
epiphenomenon, somehow linked to a 
generalized disruption of gene regulation 
in cancer cells and arising after the cancer, 
rather than playing a causal role itself. 
This essay will address how cancer epige- 
netics has overcome these objections, and 
a report in this issue by Nakagawa et al (2) 
adds significantly to this argument. 

Epigenetics is defined as modifications 
of the genome, heritable during cell divi- 
sion, that do not involve a change in the 
DNA sequence. Examples include meth- 
ylation induced premeiotically in Ascobo- 
lus, repeat-induced gene silencing and 
paramutation in plants, mating type si- 
lencing and telomere silencing in yeast, 
position effect variegation in Drosophila, 
and genomic imprinting in mammals and 
flowering plants. There are several fea- 
tures that distinguish epigenetics from 
conventional genetic mechanisms: revers- 
ibility; position effects, i.e., the ability to 
act over unexpected distances larger than 
a single gene; apparent mutations at un- 
expectedly high frequency; and the in- 
volvement of gene domains. One common 
thread to most epigenetic phenomena is 
DNA methylation, a covalent modifica- 
tion of the C5 position of cytosine. This 
methylation pattern is stably maintained 
at CpG dinucleotides by a family of DNA 
methyl transferases that recognize hemim- 
ethylated CpG dinucleotides after DNA 
replication. DNA methyltransferases be- 
long to multiprotein complexes, and they 
contain sequence motifs for multiple such 
interactions, including interactions with 
chromatin components, some of which have 
been directly identified (3). Another com- 
mon thread to epigenetics is a link to tran- 
scriptional regulation, generally involving 
gene silencing. DNA methylation in partic- 
ular is generally but not exclusively linked to 
transcriptional silencing, including methyl- 
ation induced premeiotically, paramutation, 
and mammalian gene silencing. 

Epigenetic alterations in cancer include 
global hypomethylation (4), hypomethyla- 
tion of individual genes (1), and hyper- 



methylation of CpG islands (5), CpG-rich 
sequences in the promoters of housekeep- 
ing genes that are generally protected 
from methylation. This hypermethylation 
may lead to aberrant silencing of tumor 
suppressor genes (6). In addition, we and 
others have discovered loss of imprinting 
(LOI) in cancer (7, 8). Genomic imprint- 
ing, the subject of the report by Nakagawa 
et al (2), is an epigenetic modification of 
a specific parental allele of a gene, or the 
chromosome on which it resides, in the 
gamete or zygote, leading to differential 
expression of the two alleles of the gene in 
somatic cells of the offspring. LOI involves 
loss of the normal pattern of expression of 
a specific parental allele, and in cancer it 
can lead to activation of growth-promot- 
ing imprinted genes such as insulin-like 
growth factor II (IGF2) (7, 8), as well as 
silencing of potential tumor suppressor 
genes such as /?57 KIP2 (9) mdARHl (10). 

Furthermore, we found that LOI can 
occur in the normal colonic mucosa of 
colorectal cancer patients with LOI in 
their tumors (11), overcoming the objec- 
tion that epigenetic alterations are simply 
late consequences of neoplasia. This LOI 
was linked to cases showing microsatellite 
instability (MSI) in the tumors (11). MSI 
is a form of genetic instability found in 
patients with hereditary nonpolyposis 
colorectal cancer and caused by defects in 
DNA mismatch repair (12, 13). MSI oc- 
curs much more commonly in sporadic 
nonfamilial colon cancer, affecting about 
25% of such patients. However, these pa- 
tients do not have mutations in mismatch 
repair genes (14). One potential cause of 
MSI in these sporadic cancers is hyper- 
methylation and epigenetic silencing of 
the hMLHl mismatch repair gene (15), a 
target of conventional mutations in hered- 
itary nonpolyposis colorectal cancer (16, 
17). LOI of 1GF2 has also been previously 
linked to increased methylation in embry- 
onal and other tumors, specifically at a 
CpG island that represents a differentially 
methylated region (DMR) upstream of 
the maternal H19 gene; methylation of the 
DMR in turn regulates the silencing of 
the IGF2 gene on the same chromosome 
(18, 19). 

Nakagawa et al (2) now confirm the 
original study of Cui et al. that LOI occurs 




Fig. 1. The interrelationship of cancer genetics 
and epigenetics. Rather than a traditional Venn 
diagram, cancer genetics and epigenetics are 
drawn as a yin-yang. For example, loss of imprint- 
ing might be caused by genetic disruption of CTCF 
or by altered methylation of the DMR (illustrated). 
Other examples of overlap or in which the distinc- 
tion between genetics and epigenetics are blurred 
are provided in the text. 

in both tumor and normal tissue of pa- 
tients with MSI-positive colorectal cancer 
(11), a result also confirmed earlier by 
Nishihara et al (20). Furthermore, Naka- 
gawa et al find that LOI was not a feature 
of tumors in patients with a germline 
mismatch repair gene mutation (2), indi- 
cating that LOI is not merely a conse- 
quence of a mismatch repair defect. The 
study of Nakagawa et al (2) also helps to 
close the circle among cancer, LOI, and 
DNA methylation, by demonstrating di- 
rectly that sequences within the H19 DMR 
are specifically methylated in these tu- 
mors. Not surprisingly, these same pa- 
tients show hypermethylation of other 
CpG islands throughout the genome, as 
generalized hypermethylation and MSI 
have been previously linked (21, 41). 
Methylation of the H19 DMR was also 
observed in the matched normal tissue of 
patients with LOI (2), although it should 
be noted that this occurred at a lower 
frequency than LOI in these tissues, and 
only partial methylation was observed. 



See companion article on page 591. 
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Furthermore, generalized hypermethyl- 
ation of CpG islands is not present in the 
normal tissue of these patients (2), con- 
sistent with the idea that LOI precedes a 
generalized disruption of CpG island 
methylation (11). 

The present study (2) also offers an 
intriguing mechanistic hypothesis to ex- 
plain the relationship between H19 
DMR methylation and LOI in these pa- 
tients, as the methylated nucleotides in- 
clude those to which the chromatin in- 
sulator CTCF has been shown to bind 
specifically in regulating genomic im- 
printing. Whether the partial methyl- 
ation seen in their study is sufficient 
to disrupt CTCF binding remains to be 
proven. Nevertheless, the study calls at- 
tention to this remarkable highly con- 
served multifunctional protein, first dis- 
covered by Lobanenkov and colleagues 
as a multivalent transcription factor 
(22-24) that also serves as a chromatin 
insulator. In this capacity, CTCF binds 
specifically to the H19 DMR in vivo (25) 
and in vitro (26-28) when it is unmeth- 
ylated, separating IGF2 from its en- 
hancer and allowing monoallelic DMR 
methylation-dependent expression of 
IGF2 (25-28). Lobanenkov and col- 
leagues (including us) have found muta- 
tions in CTCF in diverse tumors, that 
selectively impair binding to target se- 
quences, altering the functional spec- 
trum of the protein and, as well as meth- 
ylation of CTCF binding sites in tumors 
(G. N. Filippova, D. I. Loukinov, E. M. 
Pugacheva, J. E. Ulmer, J. M. Moore, Y. 
J. Hu, H. Moon, J. Breen, C.-F. Qi, P. E. 
Grundy, et al., unpublished work), sug- 
gesting a general role for CTCF in cancer. 

The potential link to CTCF suggested 
by this study also calls our attention to the 
link among DNA methylation, epigenet- 
ics, and chromatin. Boveri, the father of 
cancer genetics, described a generalized 
disturbance of chromatin that distin- 
guished cancer cells from normal cells 
(30). Although his writing preceded our 
understanding of DNA and thus he was 
not truly distinguishing genetics and epi- 
genetics, his thoughts were based on his 
observation of widespread disruption of 
chromosomal organization and nuclear 
structure, akin to what we might call chro- 



matin today. In this regard, a gem in the 
recent studies of CTCF was the observa- 
tion of Ohlsson and colleagues that CTCF 
binding may depend on nucleosome phas- 
ing (31), suggesting that imprinting and 
methylation may both ultimately be bound 
to the assembly of DNA into organized 
structures, 

A clue to the link between MSI and 
epigenetics may be provided by another 
sometimes overlooked common thread 
in epigenetics, namely DNA replication. 
For example, repeat-induced gene si- 
lencing is thought to be propagated 
through hemimethylated intermediates 
during DNA replication; silencing in 
yeast depends on an origin of replication 
complex; and mating type silencing, po- 
sition effect variegation, and genomic 
imprinting are all linked to delayed rep- 
lication timing. Given that MSI is attrib- 
utable to defective replication-linked 
mismatch repair, necessitating that the 
cell distinguishes between parent and 
daughter strands, all of these phenomena 
may involve disrupted chromatin. 

The studies of Cui et al. (11), Nishihara 
et al. (20), and Nakagawa et al. (2) sug- 
gest a new and provocative view of the 
timing of epigenetic changes in cancer. It 
should be noted that LOI was observed 
in these studies generally throughout the 
colon, not just at the site of the tumor. 
Although colorectal cancer in particular 
involves a series of genetic alterations in 
the evolution of an advanced metastatic 
tumor, elegantly described by Vogelstein 
(32) and others, perhaps the likelihood 
of developing clinical cancer when a mu- 
tation arises depends in part on a preex- 
isting epigenetic defect affecting much or 
all of the normal colonic mucosa. Why 
would this be? Studies of transgenic mice 
with constitutive biallelic expression of 
IGF2, comparable to LOI, show reduced 
apoptosis and increased tumor forma- 
tion on introduction of an oncogenic 
transgene (33, 34). Perhaps preexisting 
LOI alters the balance between growth 
and apoptosis when conventional onco- 
genic mutations arise in the colon. Al- 
though this idea is admittedly specula- 
tive, these studies (2, 11, 20) nevertheless 
demonstrate such a field defect in non- 
tumor tissue, and from an epidemiolog- 



ical perspective, what distinguishes a 
cancer patient from a noncancer patient 
may be such epigenetic alterations in- 
volving DNA methylation and imprint- 
ing. This hypothesis will require direct 
confirmation that the epigenetic alter- 
ations temporally precede the genetic 
changes and the tumors themselves, a 
subject of intense current clinical study. 
Such studies may have profound clinical 
significance, because they might offer 
the opportunity for detection of large 
numbers of patients in the general pop- 
ulation at increased risk of colorectal 
cancer, and the eventual possibility of 
enhanced surveillance or even chemo- 
prevention in such patients. 

I conclude by noting that the distinc- 
tion between cancer genetics and epige- 
netics has blurred considerably in recent 
years (Fig. 1). Many conventional "ge- 
netic" mechanisms directly affect pro- 
teins that regulate chromatin, such as the 
rearrangement of the trithorax family 
member ALL in common childhood leu- 
kemia (29, 35), mutation of the chroma- 
tin remodeling complex core member 
Snf5 in rhabdoid tumors (36), and rear- 
rangement of a candidate histone acetyl- 
transferase in acute myeloid leukemia 
(37). In addition, physical interactions 
nave been found between critical tumor 
genes and chromatin proteins, such as Rb 
and histone deactylase (38, 39), and the 
promyelocytic leukemia nuclear body 
protein P100 with heterochromatin pro- 
tein HP-1 (40), whose Drosophila homo- 
logue is also a suppressor of position 
effect variegation. While less is known of 
the mechanism of epigenetic alterations 
in cancer, CTCF itself offers a striking 
example of convergence of traditional 
genetics and epigenetics, as the same 
protein regulates genomic imprinting as 
well as the "traditional" oncogene c-mvc 
(23). We may well find that the geneti- 
cists and the epigeneticists converge on 
Boveri's definition of cancer as a disease 
of "the chromatin." 

I thank R. Ohlsson, V. Lobanenkov, H. Cui, J. 
Ravenel, and E. Niemitz for thoughtful com- 
ments. This work was supported by National 
Institutes of Health Grant CA65145. 
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